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We introduce a measure of the compatibility between quantum states — the likelihood that two 
density matrices describe the same object. Our measure is motivated by two elementary require- 
ments, which lead to a natural definition. We list some properties of this measure, and discuss its 
relation to the problem of combining two observers' states of knowledge. 
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The quantum superposition principle induces a qual- 
itative difference between classical and quantum states 
of knowledge. The state of a quantum system can be 
fully specified, yet not predict with certainty the out- 
come of a measurement — a state of affairs which has only 
the observers' ignorance as classical analogue. In quan- 
tum mechanics, incomplete knowledge is represented by 
a mixed density matrix, which corresponds imperfectly 
to a classical distribution; the "quantum uncertainty" of 
pure states combines with the "classical uncertainty" of a 
distribution to yield an object which can be represented 
by different decompositions or preparations. 

The fidelity of two quantum states Q pa and ps , 



F{pa,Pb) = Tr <^ J ^fpApB^fpA 



(1) 



(or more precisely F 2 ) measures the likelihood that vari- 
ous measurements made on the two states will obtain the 
same result. Thus, fidelity is a measure of similarity be- 
tween states which does not distinguish between classical 
and quantum uncertainty. 

In this letter, we introduce compatibility, a measure 
similar to fidelity, but which compares two observers' 
states of knowledge, not the results of the measurements 
which they could do. We want the compatibility to mea- 
sure classical admixture, while treating different pure 
states as fundamentally different: if two observers claim 
to have complete knowledge of a system, their descrip- 
tions had better agree completely. Hence, a compatibil- 
ity measure C(pa,Pb) should satisfy the two following 
requirements: 

1. When [pA, Pb] = (classical mixture) the compat- 
ibility should be equal to the fidelity. 

2. The compatibility of incompatible states should 
be 0. 

While our first requirement should be transparent, the 
second sounds tautological, and requires further explana- 
tion. Consider two observers (Alice and Bob) whose re- 
spective states of knowledge are described by pa and ps- 



(Throughout this letter, we use subscript k to designate 
either A or B.) Brun, Finkelstein, and Mermin defined 
Alice's and Bob's descriptions to be compatible if and 
only if they could be describing the same physical sys- 
tem. They then addressed the following question: under 
what conditions are pa and pb compatible? Their answer 
is quite simple: pa and pb are compatible if and only if 
the intersection of their supports, S — S(pa) H S(pb), 
is nonempty. The support S(p) of a density matrix p 
is the complement of its null space Af(p); to obtain the 
projector Ps( P ) onto S(p), diagonalize p and replace each 
nonzero eigenvalue with 1. Thus, pa and ps are compat- 
ible if Ps(p A )Ps(p B ) h as a t least one unit eigenvalue. In 
other words, two states of knowledge are incompatible if 
between them they rule out all possible pure states. 

With this definition, state pa — |0)(0| is compatible 
with both state p B = e|0)(0| + (l -e)|l)(l| and state p' B = 
(1 - e)|0)(0| + e|l)(l| as long as < e < 1. Nevertheless 
as e — * 0, it is clear that the compatibility of pa and ps 
should vanish while that of pa and p' B should approach 
unity. The definition of [0] makes no distinction between 
these two cases and this is what originally motivated the 
present work. 

Now that requirement 2 has been clarified, we can pro- 
ceed with the definition of the compatibility measure. 

Definition 1 Let <8o(7i) be the set of all density matri- 
ces on Hilbert space H. For p € 2$o(W), define V{p) as 
the set of preparations of p: 



V(p) = \P: f P(a)ada = p\ 

{ J<B„(H) J 



(2) 



where the P are probability distributions over 58o(7i). 
Then, the compatibility of pa and ps S ( So(Ti.) is 
defined as 

C(pa,Pb)= max / ^P A {a)P B {o-)da, (3) 



P B ev(p B ) 
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the integral representing the classical fidelity F(Pa, Pb) 
(or statistical overlap) of two classical distributions Pa 
and Pb ■ 

Lemma 1 All distributions P € V{p) must vanish out- 
side 58o(p): the set of density matrices with support re- 
stricted to S{p) (this is a slightly abusive notation). 
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Proof Let P(a) be a preparation of p. We can separate 
p in two parts: 



p = / P k {a)ada+[ P k {a)a da (4) 
= W '+(1- P y (5) 



where p', by definition, has support on Af(p), p" has 
support strictly on <S(p) and p = fs£±r p \ Pk(o~) da. Hp ^ 
0, there exists ip S A/"(/o) such that p does not annihilate 
This contradicts the definition of M(p), so we conclude 
that p = and therefore P is restricted to 2$o (p) . 

Theorem 1 Definition 1 satisfies both of our require- 
ments. 

Proof 

1. If pa commutes with ps, then they have orthogonal 
decompositions onto the same set of pure states: pa — 
X^ai|</>i)(0i|; p B = X)i&i|<A»X0»l- Thus C{p Ai Pb) > 
J2i V a ih = F(pa,Pb)- Later (see P4) we show that 
C(pa,Pb) < F{pa, Pb)', therefore for commuting density 
matrices C(p A , Pb) = F(p A , Pb)- 

2. If pa and pb are incompatible, their supports are 
disjoint, which implies that -Pa I 17 ) and Pb(°~) are re- 
stricted to disjoint sets — implying that C(pa,Pb) = 0. 

Note that this measure is not the only on which satis- 
fies our two requirements. For example, define 

D n (p A , PB )=Tr{ [{ PA ) 1/2n (PB ) 1/n {p A ) l/2n \ " } ■ (6) 

Clearly, D n (p a ,p B ) = F(p a ,Pb) when n = 1 or for any 
n when [pa, Pb] = 0, so it satisfy our first requirement. 
For the second requirement, notice that 

lim D n (p A ,p B ) < lim Tr{[F s( )]"} (7) 

which is if Ps(p B )Ps(p B ) has no unit eigenvalue, i.e. 
if pa and ps are incompatible. Therefore D(pa, Pb) = 
linin^oo D n (pA, Pb) is a valid measure of compatibility. 
Definition [l] can also be generalized to 



E a (pA, Pb) 



max 

P A £V(p A ) 
Pb^V(pb) 



[P A {<r)]*\PB{<T)?- a d0 



(8) 



< a < 1 which is the Renyi overlap of Pa and Pb, 
the fidelity corresponding to the special case a = 1/2. 
This definition allows for an asymmetry between Alice 
and Bob which can be useful when one of the participant 
is more trustworthy than the other. 

Although these alternative definitions offers some in- 
teresting features, we shall concentrate on Definition |l| in 
the following. (Superscript D and E indicates that the 
results also hold for measure D(pa, Pb) and E a (pA, Pb) 
respectively, the proofs are given for C(pa,Pb) only.) 



Theorem 2 E To compute the compatibility of two 
states, it is sufficient to maximize over pure state prepa- 
rations. In other words 

C{pa,Pb)= max / v / Q^W0Qs(V0# (9) 
Qa£Q(pa) J<BUn) 
Qb£Q(pb) 

where Q3o(7i) is the set of all pure states in Ji and Q(p) 
is the set of pure state preparations of p: 



Q{p) = \Q- / Q(i>)\i>Md^ = P>, (io) 
{ J< s>im J 

Q are probabilities distributions on Q3j(7i). 

Proof Choose a standard preparation for a € Q5oCW), a — 
/^(VOIV'XV'W ( e -g-i eigendecomposition) . Then 



L 



93o(W) 



V 'P A {a)P B {a) da 



(11) 



<8 (n) J<&l{n) 



y/PA(<T)UMPB(<r)UW da d*P 



- / vqawqbw rfv 



since fidelity can only increase under the marginalization 
QkW = J<B o{n) Pk{o)fa{ip) da. 

Theorem 3 When one of the two states is pure (say 
Pb), C(pa, Pb) = ^/p where p is given by 



p = mm <q: det {p A - qpB} = } (12) 

if Pb lies within S(pa) and p = otherwise. 

Proof There is a unique preparation for ps'- Pb{&) — 
5 (a — ps). The maximum value of q for which we can 
write pa = qpB + (1— q)a (with a a valid density matrix) 
is p. The result follows. 

Theorem 4 E Any local maximum of F(Pa, Pb) over 
P{pa) ®V{pb) is a global maximum. 

Proof Fidelity is a concave function: F(XPa+ 
[1 - \}P' a ,Pb) > \F(P a ,Pb) + [1 - \]F(P' a ,Pb). The 
sets V(pa) and V(pb) are convex: any convex combina- 
tions of valid probability distributions of mean p is also 
a valid probability distribution with mean p. The result 
follows automatically. 

We now give a list of properties of the compatibility 
measure. 

P1 D C(patPb) is symmetric. 

P2 DE Compatibility is invariant under unitary transfor- 
mation: C{U PA U\Up B U^) = C( Pa ,Pb)- 

P3 DE For pure states C(iPa,iPb) = 1 if and only if 
| (iPa^b) | 2 = 1 and otherwise. 
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P4 D (Upper bound) C(p A ,Pc) < F(p A ,p B ). 

P5 DE F(p A , PB )=0^ C{p A ,p B ) = and F(p A , p B ) = 
1 & C(p A ,p B ) = 1 <=> p A = Pb- 

P6 (Lower bound) C(p a ,pb) > r^/e A €B where Ck is the 
greatest value of q for which one can write pk = -Ps + 
(1 — q)a with <7 being a valid density matrix, see (eq. 12), 
and r — Tr{P$} is the dimension of S = S(p A ) n S(ps)- 
(For compatible states, tk > A°, the smallest nonzero 
eigenvalue of P k-) 

P7 E (Multiplicativity) C(p A ® p' A , p B ® Ps) > 
C(p A ,p B )C(p' A ,p' B ). 

Proofs 

PI, P2, and P3 are straightforward from Definition |l|. 

P4: Assume that Qfc(V>) are the optimal distributions 
given by Theorem g. We choose x € lR( 2Ar_2 ' (where AT 
is the dimension of TL) as a parameterization for !Bj(7i): 
ip = iJj(x), and construct the purifications 



I** 



VQMx)) \ip(x)) ® |f) (13) 



where x is now treated as a quantum continuous vari- 
able (x\x') — S(x — x') (e.g. position of a particle in 
a TV-dimensional box). Then C(p A ,p B ) = < 
F(Pa-i Pb) since the fidelity is the maximum of this quan- 
tity over all purifications. 

This proof introduces an interesting distinction be- 
tween fidelity and compatibility. Fidelity is the optimal 
inner product between all purifications of p A and p B . On 
the other hand, compatibility involves purifications of a 
very special kind (eq.l^). All that is needed to transform 
compatibility into fidelity is to replace (eq.O) by 



= / y /Q A y>(g)) \4>(2)) ® U A \x) dx 
for arbitrary unitary operators U A and C/s. 



(14) 



P5 follows from F(p A ,ps) = 1 Pa = Ps 7 requirement 
1, and P4. 

P6: We can choose a distribution where has probabil- 
ity rek at the point a = Ps/f- 

P7: The product of the optimal distributions for 
C(p A , p B ) and C(p' A , p' B ) are valid distributions over the 
combined Hilbert space but might not be optimal. We do 
not know if this inequality can be reduced to an equality. 
In other words, it is possible that the optimal distribution 
for pa ® p'a and Pb <8 p' B involve non product states. 

It is worth mentioning that no smooth function of 
the compatibility satisfying /(C) = 1 C — and 
/(C) = C = 1 can be used to build a metric 
on Q3o(W). This is best illustrated by the following 2- 
dimensional example. Assume states p + and p_ are 
pure, derived from \ip±) = cose|0) ±sine|l), and po — 
(1 — e)|0)(0| + e|l)(l| where e — > 0. One can easily verify 



thatC(p+,p_) = 0andC(p+,p o ) = C(p_,p ) = l-O(e) 
so f(C( P+ ,p 7 )) = 1 > f(C(p + ,p )) + f(C(p r ,p )) r 
as e — > 0. This is in contrast with classical distributions: 
when [pAjPs] = 0, cos -1 F{^fp A p B ) is a valid distance 
measure [[|. 

Measurement — Suppose that Alice and Bob acquire their 
knowledge of p A and p B through measurement. These 
states will always be compatible: incompatible knowl- 
edge acquired through measurement would indicate an 
inconsistency in quantum theory 1 . For example, they can 
each be given many copies of a quantum system in state 
p of which they initially have no knowledge except the 
dimension. They carry out independent measurements 
on those copies and, with the help of Bayesian rules, up- 
date their description of the system (see and refer- 
ences therein). As mentioned earlier, their descriptions 
will always be compatible. Nevertheless, a low compati- 
bility could result as a consequence of one of the following 
situations: i) they were given copies of different states; 
ii) their measurement apparatus are miscalibrated; or 
Hi) they are in a very improbable branch of the Universe. 

These eventualities cannot be detected by the fidelity 
of p A and p B . For example, suppose that, for a 2-level 
system, 



PA 

PB = 



= (1 
(1- 



e)|0)(0| 



111, 



(15) 



where |+) = -^(|0) + |1)). As the observers' knowledge 

becomes more and more accurate (e — > 0), the compat- 
ibility goes to 0, indicating one of the three situations 
listed above. On the other hand, fidelity saturates at 
F 2 = 1/2, which is the same as if both Alice and Bob 
had a vague knowledge of the state, e.g. 



PA = (h + a 
Pb = (5 - a 



+ a~a)\l)(l\ 

+ (l + «)ll)(ll 



with a = V2/4. This clearly illustrates the fact that 
fidelity makes no distinction between classical and quan- 
tum uncertainty. 

Combining knowledge — Now, suppose Alice and Bob 
want to pool their information. If C{p Ai pb) — (which 
cannot result from measurement), their "knowledge" is 
contradictory. When C(p A ,ps) > 0, however, they can 
combine their states of knowledge to get a new density 
matrix p A s ■ This issue has recently been studied by Ja- 
cobs |s| but with the only conclusion that p A s should lie 
in S(p A ) n S(p B )- 

We propose that the state obtained from combining 
two states of knowledge should be the one which is max- 
imally compatible with both of them. This requires a 



1 Incompatible knowledge could emerge as a consequence of the 
finite accuracy of the measurement apparatus: nevertheless, such 
limitations should be taken into account in the state estimation. 
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definition of three-way compatibility: 

C{p A ,PB,Pc)= (17) 
max / yP A (a)P B (a)P c (a)da. 

Pa£V{pa) JtBoCH) 
Pb^V(p b ) 

Pcev(pc) 

Hence, our rule for combining states of knowledge reads 

Pab = Argument ^max C(p A , Pb, p)^j ; (18) 

in the eventuality that the maximum over p is not unique, 
one can discriminate with a maximum entropy S(p) cri- 
teria which is well motivated in the current context. For 
any fixed Pa and Pb, the Pc that optimizes (eqjl8|) is 
proportional to the geometric average of Pa and Pb- 
Therefore, defining Pa and Pb as the distributions which 
optimized equation (|3|), we get 

Pab = / P A b{o)o da (19) 

J<8 (H) 

where Pab = VPaPb / C{pa, Pb)- Furthermore, there is 
a simple relation between the optimal three-way compat- 
ibility and the compatibility of the two original descrip- 
tions: C(pa,Pb,Pab) 3 = C(pa,Pb) 2 - 

Knowledge — Knowledge of a quantum system can take 
many forms; as Bennett expresses it, 

It is possible to know or possess a quan- 
tum state in infinitely many physically in- 
equivalent ways, ranging from complete clas- 
sical knowledge, through possession of a sin- 
gle specimen of the state, to weaker and less 
compactly embodiable forms such as the abil- 
ity to simulate the outcome of a single POVM 
measurement on the state. || 

The compatibility measurement of (eq||) is meaning- 
ful when we consider classical description of the quan- 
tum states; the quantum fidelity (eq.[j]) corresponds to a 



situation where single specimens of the quantum states 
are available (respectively "knowledge of the quantum" 
and "quantum knowledge" ) . One can define compatibil- 
ity measurements according to the type of knowledge one 
is dealing with. For example, we can define the com- 
patibility between a state p and an ensemble {qj, o~j} as 
max.p( p ) F(P, Q) where Q(cr) = J^ . qjS(a — o-j). While 
the pure state |+) = -i=(|0) + 11)) is compatible with the 
ensemble £i = {(l,p|0)(0| + (l-p)|l)(l|)}, it is incompat- 
ible with the ensemble E 2 = {(p, |0)(0|), ((1 -p), |1)(1|)}, 
even if they are preparations of the same state. 

An ensemble embodies more knowledge than its asso- 
ciated (average) state. In our prescription for combining 
knowledge, we have assumed that all of Alice's and Bob's 
knowledge was encapsulated in their respective density 
matrices. Note that all knowledge can be represented in 
this form by including ancillary systems [e.g. eq.(]l3|)]. 

Suppose, instead, that both Alice's and Bob's states 
of knowledge are represented by the ensemble Ei . Obvi- 
ously, their combined density matrix should be pabi — 
p\0)(0 1+(1 - p) 1 1)(1| . On the other hand, when both their 
states of knowledge are E 2 , Bayesian rules would suggest 
that their combined state should be pab2 — P 2 |0)(0| + 
( 1 — j>) 2 1 1 (with proper normalization) — but this as- 
sumes that their knowledge was acquired independently 
If their knowledge came from a redundant source, 
the Bayesian rule would then yield state pabi, as would 
our prescription. 

Hence, this illustrates that our rule for combining 
states of knowledge assumes no more information than 
what is encapsulated in the density matrices. Further- 
more, it can quite simply be adapted to different forms 
of knowledge, either through the use of ancillary systems 
or of generalized compatibility measures. 
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